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Background of the Invention 

The invention relates generally to nucleic acids and polypeptides encoded thereby, and 
methods of using these nucleic acids and polypeptides. 

1 0 S UMM ARY OF THE INVENTION 

The invention is based in part on the discovery of nucleic acids that include open reading 
frames encoding novel polypeptides* and on the polypeptides encoded thereby. The nucleic 
acids and polypeptides are collectively referred to herein as "ORFX". 

Accordingly, in one aspect, the invention provides an isolated nucleic acid molecule 
15 (SEQ ID NO:2w-l, wherein n is an integer between 1-3161), that encodes novel polypeptide, or a 
fragment, homolog, analog or derivative thereof. The nucleic acid can include, e.g., a nucleic 
acid sequence encoding a polypeptide at least 85% identical to a polypeptide comprising the 
amino acid sequences of SEQ ID NO:2w, wherein n is an integer between 1-3161. The nucleic 
acid can be, e.g., a genomic DNA fragment, or a cDNA molecule. 
20 Also included in the invention is a vector containing one or more of the nucleic acids 

described herein, and a cell containing the vectors or nucleic acids described herein. 

The invention is also directed to host cells transformed with a recombinant expression 
vector comprising any of the nucleic acid molecules described above. 

In another aspect, the invention includes a pharmaceutical composition that includes an 
25 ORFX nucleic acid and a pharmaceutical^ acceptable carrier or diluent 

1 
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In a further aspect, the invention includes a substantially purified ORF polypeptide, e.g., 
any of the ORFX polypeptides encoded by an ORFX nucleic acid, and fragments, homologs, 
analogs, and derivatives thereof. The invention also includes a pharmaceutical composition that 
includes a ORFX polypeptide and a pharmaceutical^ acceptable carrier or diluent. 
5 In a still a further aspect, the invention provides an antibody that binds specifically to an 

ORFX polypeptide. The antibody can be, e.g., a monoclonal or polyclonal antibody, and 
fragments, homologs, analogs, and derivatives thereof. The invention also includes a 
pharmaceutical composition including ORFX antibody and a pharmaceutically acceptable carrier 
or diluent. The invention is also directed to isolated antibodies that bind to an epitope on a 
10 polypeptide encoded by any of the nucleic acid molecules described above. 

The invention also includes kits comprising any of the pharmaceutical compositions 
described above. 

The invention further provides a method for producing an ORFX polypeptide by 
providing a cell containing a ORFX nucleic acid, e.g., a vector that includes a ORFX nucleic 
15 acid, and culturing the cell under conditions sufficient to express the ORFX polypeptide encoded 
by the nucleic acid. The expressed ORFX polypeptide is then recovered from the cell. 
Preferably, the cell produces little or no endogenous ORFX polypeptide. The cell can be, e.g. y a 
prokaryotic cell or eukaryotic cell. 

The invention is also directed to methods of identifying an ORFX polypeptide or nucleic 
20 acids in a sample by contacting the sample with a compound that specifically binds to the 
polypeptide or nucleic acid, and detecting complex formation, if present. 

The invention further provides methods of identifying a compound that modulates the 
activity of a ORFX polypeptide by contacting ORFX polypeptide with a compound and 
determining whether the ORFX polypeptide activity is modified. 
25 The invention is also directed to compounds that modulate ORFX polypeptide activity 

identified by contacting a ORFX polypeptide with the compound and determining whether the 
compound modifies activity of the ORFX polypeptide, binds to the ORFX polypeptide, or binds 
to a nucleic acid molecule encoding a ORFX polypeptide. 

In a another aspect, the invention provides a method of determining the presence of or 
30 predisposition of an ORFX-associated disorder in a subject. The method includes providing a 
sample from the subject and measuring the amount of ORFX polypeptide in the subject sample. 
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The amount of ORFX polypeptide in the subject sample is then compared to the amount of 
ORFX polypeptide in a control sample. An alteration in the amount of ORFX polypeptide in the 
subject protein sample relative to the amount of ORFX polypeptide in the control protein sample 
indicates the subject has a tissue proliferation-associated condition. A control sample is 
5 preferably taken from a matched individual, f.e., an individual of similar age, sex, or other 

general condition but who is not suspected of having a tissue proliferation-associated condition. 
Alternatively, the control sample may be taken from the subject at a time when the subject is not 
suspected of having a tissue proliferation-associated disorder. In some embodiments, the ORFX 
is detected using a ORFX antibody. 

10 In a further aspect, the invention provides a method of determining the presence of or 

predisposition of an ORFX-associated disorder in a subject The method includes providing a 
nucleic acid sample, e.g., RNA or DNA, or both, from the subject and measuring the amount of 
the ORFX nucleic acid in the subject nucleic acid sample. The amount of ORFX nucleic acid 
sample in the subject nucleic acid is then compared to the amount of an ORFX nucleic acid in a 

1 5 control sample. An alteration in the amount of ORFX nucleic acid in the sample relative to the 
amount of ORFX in the control sample indicates the subject has a tissue proliferation-associated 
disorder. 

In a still further aspect, the invention provides method of treating or preventing or 
delaying a ORFX-associated disorder. The method includes administering to a subject in which 

20 such treatment or prevention or delay is desired a ORFX nucleic acid, a ORFX polypeptide, or 
an ORFX antibody in an amount sufficient to treat, prevent, or delay a tissue proliferation- 
associated disorder in the subject. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 

25 belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
specification, including definitions, will control. In addition, the materials, methods, and 

30 examples are illustrative only and not intended to be limiting. 
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Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

Detailed Description Of The Invention 

The invention provides novel polypeptides and nucleotides encoded thereby. The 
5 polynucleotides and their encoded polypeptides can be grouped according to the functions played 
by their gene products. Such functions include, structural proteins, proteins from which 
associated with metabolic pathways fatty acid metabolism, glycolysis, intermediary metabolism, 
calcium metabolism, proteases, and amino acid metabolism, etc. 

Included in the invention are 3161 novel nucleic acid sequences and their encoded 
10 polypeptides. The sequences are collectively referred to as "ORFX nucleic acids" or ORFX 
polynucleotides" and the corresponding encoded polypeptide is referred to as a "ORFX 
polypeptide" or ORFX protein". For example, an ORFX nucleic acid according to the invention 
is a nucleic acid including an ORF1 nucleic acid, and an ORF polypeptide according to the 
invention is a polypeptide that includes the amino acid sequence of an ORF1 polypeptide. 
15 Unless indicated otherwise, "ORFX" is meant to refer to any of the ORF 1-3 161 sequences 
disclosed herein. 

Table 1 provides a summary of the ORFX nucleic acids and their encoded polypeptides 
are summarized in Table 1 . Nucleic acid sequences and polypeptide sequences for ORFX 
nucleic acids according to the invention is provided in the section of the specification entitled 
20 "Disclosed Sequences of ORFX Nucleic Acid and Polypeptide Sequences." 

Column 1 of Table 1, entitled "ORF #", denotes an ORF number assigned to a nucleic 
acid containing an open reading frame according to the invention. 

Column 2 of Table 1, entitled "Internal Identification number (Nucleic Acid Sequence 
Identification Number, Polypeptide Sequence Identification Number), provides an internal 
25 identification number for the indicated ORF, along with sequence identification numbers (SEQ 
ID NOs.) corresponding to the indicated ORF. In general, for an ORFw according to the 
invention (wherein n is any integer from 1 to 3 161), a nucleic acid corresponding to the ORF is 
SEQ ID NO:2/?-l, and an amino acid sequence encoded by the ORF is SEQ ID NO:2«. For 
example, a nucleic acid sequence corresponding to an ORF1 nucleic acid is SEQ ID NO:l, and a 
30 polypeptide sequence corresponding to an ORF1 polypeptide is SEQ ID NO:2. Similarly, a 
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nucleic acid sequence corresponding to an ORF4 nucleic acid is SEQ ID NO:7, and a 
polypeptide sequence corresponding to an ORF4 polypeptide is SEQ ID NO:8; a nucleic acid 
sequence corresponding to an ORF198 nucleic acid sequence is SEQ ID NO:395, and a 
polypeptide sequence corresponding to an ORF198 polypeptide is SEQ ID NO:396. Nucleic acid 
5 sequences and polypeptide sequences for ORFX nucleic acids according to the invention are 

provided in the section of the specification entitled "Disclosed Sequences of ORFX Nucleic Acid 
and Polypeptide Sequences." 

Column 2 of Table 1, entitled "Protein Similarity", lists previously described proteins 
that are related to polypeptides encoded by the ORFs. Genbank identifiers for the previously 

10 described proteins are provided. These can be retrieved from http://www .ncbi.nlm.nih.gov/. 

To determine similarity to previously described proteins, polypeptides encoded by ORFX 
DNA sequences were tested using the Framesearch Algorithm against a nonredundant version of 
the GenPept Database from NCBI/Genbank. DNA sequences that had a score of W or above 
(Framesearch algorithm score, Edelman et. ah GCG Genetics) to a known protein were selected. 

1 5 Open reading frames were extended beyond the region of the protein matched using standard 
DNA translation and codon tables. Novel proteins that lacked a protein match were translated 
against the standard genetic codons and proteins with an ORF at least 80 amino acids and 
containing a Methionine start are included in the Table. 

Column 3 of Table 3, entitled "Protein Domains", lists previously described protein 

20 domains, designated by pfam entries, that are present in polypeptides encoded by the ORFs, 

Also included in column 3 are proteins in which these domains are present. The pfam entries can 
be retrieved from http://pfam.wustl.edu/. DNA sequences were translated in all six frames and 
tested using the Hmmer Algorithm against the Pfam Database (References to the algorithm and 
Pfam database can be found at http://pfam.wustl.eduy Translated DNA sequences that 

25 matched a protein domain entry in the Pfam database AND had a score of 7.5* were selected. 

Column 4 of Table 3, entitled "Protein Classification", lists the type of classification 
assigned for the protein, based on its homology. Examples of proteins in the classification 
include the following proteins: 
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Amylases 

Amylase is responsible for endohydrolysis of 1 ,4-alpha-glucosidic linkages in 
oligosaccharides and polysaccharides. Variations in amylase gene may be indicative of delayed 
maturation and of various amylase producing neoplasms and carcinomas. 

5 Amyloid 

The serum amyloid A (SAA) proteins comprise a family of vertebrate proteins that 
associate predominantly with high density lipoproteins (HDL). The synthesis of certain members 
of the family is greatly increased in inflammation. Prolonged elevation of plasma SAA levels, as 
in chronic inflammation, 1 5 results in a pathological condition, called amyloidosis, which affects 

10 the liver, kidney and spleen and which is characterized by the highly insoluble accumulation of 
SAA in these tissues. Amyloid selectively inhibits insulin-stimulated glucose utilization and 
glycogen deposition in muscle, while not affecting adipocyte glucose metabolism. Deposition of 
fibrillar amyloid proteins intraneuronally, as neurofibrillary tangles, extracellularly, as plaques 
and in blood vessels, is characteristic of both Alzheimer's disease and aged Down's syndrome. 

15 Amyloid deposition is also associated with type II diabetes mellitus. 

Angiopoeitin 

Members of the angiopoeitin/flbrinogen family have been shown to stimulate the 
generation of new blood vessels, inhibit the generation of new blood vessels, and perform several 
roles in blood clotting. This generation of new blood vessels, called angiogenesis, is also an 
20 essential step in tumor growth in order for the tumor to get the blood supply it needs to expand. 
Variation in these genes may be predictive of any form of heart disease, numerous blood clotting 
disorders, stroke, hypertension and predisposition to tumor formation and metastasis. In 
particular, these variants may be predictive of the response to various antihypertensive drugs and 
chemotherapeutic and anti-tumor agents. 

23 Apoptosis-related proteins 

Active cell suicide (apoptosis) is induced by events such as growth factor withdrawal and 
toxins. It is controlled by regulators, which have either an inhibitory effect on programmed cell 
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death (anti-apoptotic) or block the protective effect of inhibitors (pro-apoptotic). Many viruses 
have found a way of countering defensive apoptosis by encoding their own anti-apoptosis genes 
preventing their target-cells from dying too soon. Variants of apoptosis related genes may be 
useful in formulation of anti-aging drugs. 

Cadherin, Cyclin, Polymerase, Oncogenes, Histones, Kinases 

Members of the cell division/cell cycle pathways such as cyclins, many transcription 
factors and kinases, DNA polymerases, histones, helicases and other oncogenes play a critical 
role in carcinogenesis where the uncontrolled proliferation of cells leads to tumor formation and 
eventually metastasis. Variation in these genes may be predictive of predisposition to any form 
of cancer, from increased risk of tumor formation to increased rate of metastasis. In particular, 
these variants may be predictive of the response to various chemotherapeutic and anti-tumor 
agents. 

Colony-stimulating factor-related proteins 

Granulocyte/macrophage colony-stimulating factors are cytokines that act in 
hematopoiesis by controlling the production, differentiation, and function of 2 related white cell 
populations of the blood, the granulocytes and the monocytes-macrophages. 

Complement-related proteins 

Complement proteins are immune associated cytotoxic agents, acting in a chain reaction 
to exterminate target cells to that were opsonized (primed) with antibodies, by forming a 
membrane attack complex (MAC). The mechanism of killing is by opening pores in the target 
cell membrane. Variations in 20 complement genes or their inhibitors are associated with many 
autoimmune disorders. Modified serum levels of complement products cause edemas of various 
tissues, lupus (SLE), vasculitis, glomerulonephritis, renal failure, hemolytic anemia, 
thrombocytopenia, and arthritis. They interfere with mechanisms of ADCC (antibody dependent 
cell cytotoxicity), severely impair immune competence and reduce phagocytic ability. Variants 
of complement genes may also be indicative of type I diabetes mellitus, meningitis neurological 
disorders such as nemaline myopathy, neonatal hypotonia, muscular disorders such as congenital 
myopathy and other diseases. 
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Cytochrome 

The respiratory chain is a key biochemical pathway which is essential to all aerobic cells. 
There are five different cytochromes involved in the chain. These are heme bound proteins 
which serve as electron carriers. Modifications in these genes may be predictive of ataxia 
areflexia, dementia and myopathic and neuropathic changes in muscles. Also, association with 
various types of solid tumors. 

Kinesins 

Kinesins are tubulin molecular motors that function to transport organelles within cells 
and to move chromosomes along microtubules during cell division. Modifications of these genes 
may be indicative of neurological disorders such as Pick disease of the brain, tuberous sclerosis. 

Cytokines, Interferon, Interleukin 

Members of the cytokine families are known for their potent ability to stimulate cell 
growth and division even at low concentrations. Cytokines such as erythropoietin are 
cell-specific in their growth stimulation; erythropoietin is useful for the stimulation of the 
proliferation of erythroblasts. Variants in cytokines may be predictive for a wide variety of 
diseases, including cancer predisposition. 

G-protein coupled receptors 

G-protein coupled receptors (also called R7G) are an extensive group of hormones, 
neurotransmitters, odorants and light receptors which transduce extracellular signals by 
interaction with guanine nucleotide-binding (G) proteins. Alterations in genes coding for 
G-coupled proteins may be involved in and indicative of a vast number of physiological 
conditions. These include blood pressure regulation, renal dysfunctions, male infertility, 
dopamine associated cognitive, emotional, and endocrine functions, hypercalcemia, 
chondrodysplasia and osteoporosis, pseudohypoparathyroidism, growth retardation and 
dwarfism. 
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Tbioesterases 

Eukaryotic thiol proteases are a family of proteolytic enzymes which contain an active 
site cysteine. Catalysis proceeds through a thioester intermediate and is facilitated by a nearby 
histidine side chain; an asparagine completes the essential catalytic triad. Variants of thioester 
5 associated genes may be predictive of neuronal disorders and mental illnesses such as Ceroid 
Lipofuscinosis, Neuronal 1, Infantile, Santavuori disease and more. 



The key to the molecule type is as follows: 
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Abbrev: 


Title: 




amylase 


amylase protein 




amylaseinhib 


amylase inhibitor 




amyloid 


amyloid protein 


15 


apoptosis 


apoptosis associated protein 




apoptosisinhib 


apoptosis inhibitors 




apoptosisrecep 


apoptosis receptors 




ATPase_associated 


ATPase associated protein 




biotindep 


biotin dependent enzyme/protein 


20 


cadherin 


cadherin protein 




calcium_channel 


calcium channel protein 




carboxylase 


carboxylase protein 




cathepsin 


cathepsin/carboxypeptidases 




cathepsininhib 


cathepsin/carboxypeptidase inhibitor 


25 


chloride_channel 


chloride channel protein 




collagen 


collagen 




complement 


complement protein 




complementrecept 


complement receptor protein 




complementinhib 


complement inhibitor 


30 


csf 


colony stimulating factor 




csfrecept 


colony stimulating factor receptor 




cyclin 


cyclin protein 




cyto450 


cytochrome p450 protein 




cytochrome 


cytochrome related protein 


35 


deaminase 


deaminase 




dehydrogenase 


dehydrogenase 




desaturase 


desaturase 




dna_rna_bind 


DNA/RNA binding protein/factor 




dna_rna_inhib 


DNA/RNA binding protein/factor inhibitor 


40 


dynein 


dynein 
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elastase 

elastaseinhib 

eph 

esterase 
esteraseinhib 

fgf 

fgfreceptor 
gaba 

glucoamylase 

glucoronidase 

glycoprotein 

Guanylyl 

helicase 

histone 

HOM 

homeobox 

hydrolase 

hydroxysteroid 

hypoxanthine 

immunoglob 

immunoglobrecept 

interferon 

interieukin 

interleukinrecept 

isomerase 

isomeraseinhibitor 

isomerasereceptor 

kinase 

kinaseinhibitor 

kinasereceptor 

kinesin 

laminin 

lipase 

metallothionein 
MHC 

misc_channel 
ngf 

nuci_recpt 

nuclease 

oncogene 

oxidase 

oxygenase 

peptidase 

peroxidase 

phosphatase 

phosphataseinhib 



elastase 

elastase inhibitor 

EPH family of tyrosine kinases 

esterase 

esterase inhibitor 

fibroblast growth factor 

fibroblast growth factor receptor 

GABA receptor 

glucoamylase 

glucoronidase 

glycoprotein 

guanylylate cyclase 

helicase 

histone 

homologous 

homeobox protein 

hydrolase 

hydroxysteroid associated protein 

hypoxanthine associated protein 

immunoglobulin 

immunoglobulin receptor 

interferon 

interieukin 

interieukin receptor 

isomerase 

isomerase inhibitor 

isomerase receptor 

kinase 

kinase inhibitor 
kinase receptor 
kinesin 

laminin associated protein 
lipase 

metallothionein 

major histocompatability complex 

miscellaneous channel 

nerve growth factor 

nuclear receptor 

nuclease 

oncogene associated protein 

oxidase 

oxygenase 

peptidase 

peroxidase 

phosphatase 

phosphatase inhibitor 
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phosphorylase 
PIR 

polymerase 

potassium_channel 

prostaglandin 

protease 

proteaseinhib 

reductase 

ribosomalprot 

RTR 



SIM 
SPTR 



struct 

sulfotransferase 
SWP 

SWPN 

synthase 

tgf 

tgfreceptor 
thioesterase 
thiolase 
tm7 

tnf 
traffic 
tnfreceptor 
TRN 

transcriptfactor 

transferase 

transport 

tubulin 

ubiquitin 

unclassified 

water channel 



phosphorylase 

PIR DATABASE (release 56, 29-OCT- 
1998) 

polymerase 

potassium channel protein 

prostaglandin 

protease 

protease inhibitor 
reductase 

ribosomal associated protein 
EMBLDATABASE translated entries not to 
be incorporated into SWISS-PROT (20- 
JUL-1998) 
similar 

EMBL DATABASE translated entries to be 
incorporated into SWISS-PROT (20-JUL- 
1998) 

structural associated protein 
sulfotransferase 

SWISS-PROT DATABASE (release 1 8- 
OCT-1998) 

SWISS-PROT Update (release 1 l-NOV-98) 
synthase 

transforming growth factor 
transforming growth factor receptor 
thioesterase 
thiolase 

seven transmembrane domain G-protein 

coupled receptor 

necrosis factor receptor 

tumor necrosis factor 

tumor trafficking associated protein 

EMBL DATABASE translated entries 

update (20-JUL-1998) 

transcription factor 

transferase 

transport protein 

tubulin 

ubiquitin 

Protein not categorized into one of the 
aforementioned protein families 
water channel protein 
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Column 5 of Table 1, entitled, "Cells or Tissues in Which Gene is Expressed", denotes tissues, 
represented by five digit numbers, in which RNA homologous to the ORF nucleic acid 
sequences is present. Tissues or cells corresponding to the numbers are provided in Table 2. 

ORFX nucleic acids, and their encoded polypeptides, according to the invention are 
useful in a variety of applications and contexts. For example, various ORFX nucleic acids and 
polypeptides according to the invention are useful, inter alia, as novel members of the protein 
families indicated in Table 1, and/or according to the presence of domains and sequence 
relatedness to previously described proteins as summarized in Table 1 . 

ORFX nucleic acids and polypeptides according to the invention can also be used to 
identify cell types listed in Table 1 for an indicated ORFX according to the invention. 
Additional utilities for ORFX nucleic acids and polypeptides according to the invention are 
disclosed herein. 

ORFX Nucleic Acids 

The novel nucleic acids of the invention include those that encode an ORFX or ORFX- 
like protein, or biologically active portions thereof. The nucleic acids include nucleic acids 
encoding polypeptides that include the amino acid sequence of one or more of SEQ ID NO:2w, 
wherein /?=lto3161. The encoded polypeptides can thus include, e.g., the amino acid 
sequences of SEQ ID NO: 2, 4, 6, 8, 10, . . 6310, 6312, 6314, 6316, 6318, 6320, and/or 6322. 

In some embodiments, a nucleic acid encoding a polypeptide having the amino acid 
sequence of one or more of SEQ ID NO:2« (wherein n = 1 to 3161) includes the nucleic acid 
sequence of any of SEQ ID NO:2w-l (wherein n - 1 to 3161), or a fragment thereof. 
Additionally, the invention includes mutant or variant nucleic acids of any of SEQ ID NO:2/?-l 
(wherein n = 1 to 3 1 61), or a fragment thereof, any of whose bases may be changed from the 
disclosed sequence while still encoding a protein that maintains its ORFX -like activities and 
physiological functions. The invention further includes the complement of the nucleic acid 
sequence of any of SEQ ID NO:2n-l (wherein n = 1 to 3161), including fragments, derivatives, 
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20 25 30 



Ser 


Thr 


Leu Asp 


Gly .Ala Ala Ala Arg 


Ala 


Phe Tyr 


Glu 


Ala 


Leu 


He 






35 










40 








45 








Gly 


Asp 


Glu 


Ser 


Ser 


Ala 


Pro 


Asp 


Ser 


Gin 


Arg Ser 


Gin 


Thr 


Glu 


Pro 




50 










55 








60 










Ala 


Arg 


Glu 


Arg 


Lys 


Arg 


Lys 


i*ys 


Arg 


Arg 


He Met 


Lys 


Ala 


Pro 


Ala 


65 










70 
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Glu 
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Val 


Ala 


Glu Gly Ala 


Ser 


Gly Arg His Gly Gin 


Gly Arg 










85 
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Ser 
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Glu 


Ala 


Glu 


Asp 


Lys 


Met 
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Arg He 


Leu 


Arg 


Ala 


Ala 








100 










105 








110 






Gin 


Glu 


Gly 


Asp 


Leu 


Pro 


Glu 


Leu Arg 


Arg 


Leu Leu 


Glu 


Pro 


His 


Glu 






115 










120 








125 








Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


Asn 


He 


Asn 


Ala 


Arg Asp 


Ala 


Phe 


Trp 


Trp 
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135 








140 










Thr 


Pro 


Leu 


Met 


Cys 


Ala 


Ala 


Arg 


Ala 


Gly Gin Gly 


Ala 


Ala 


Val 


Ser 


145 










150 










155 
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Leu 
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Gly 


Arg 


Gly Ala 


Ala 


Trp 


val 


Gljy val 


Cys 


Glu 


Leu 


Ser 










165 










170 








175 




Gly 
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Asp 
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Ala Gly 
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Val 








180 










185 








190 






Ala 
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Ser 


His 


Gly 
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Thr Arg 
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Pro 


Glu 
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200 








205 








Arg 


Ser 


Pro 


Thr 


Pro 


Ser 


Leu 


Gin 


Tyr 


Cys 


Glu Asn 


Cys 


Asp 


Thr 


His 




210 










215 








220 










Phe 


Gin 




Ser 


Asn 


His 


Arg 


Thr 


Ser 


Thr 


Ala His 


Leu 


Leu 


Ser 


Leu 


225 








230 










235 








240 


Ser 


Gin 


Gly 


Pro 


Gin 


Pro 


Pro 


Asn 


Leu 


Pro 


Leu Gly Val 


Pro 


lie 


Ser 
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Pro 


Gly 


Plie 


Lvs 


Leu 


Leu 


Leu 


Arg 


Gljy 


Gly Trp 


Glu 


Pro 


Gly 


Met 
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Leu 
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Pro 


Arg 
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Glu 


Gly Arg 


Ala 


Asn Pro 


lie 


Pro 


Thr 


Val 
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285 








Leu 


Lys 


Arg 


Asp 


Gin 


Glu 


Gly 


Leu 


Gly 


Tyr 


Arg Ser 


Ala 


Pro 


Gin 


Pro 




290 










295 
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Arg 


Val 


Thr 


His 


Phe 


Pro 
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Trp 


Asp 


Thr 


Arg Ala 


Val 


Ala 


Gly Arg 


305 










310 
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320 


Glu 


Arg 
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Pro 


Arg 


vaa 


Ala 


Thr 


Leu 


Ser 


Trp Arg 


Glu 


Glu 


Arcj 


Arg 










325 










330 
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Arg 


Glu 


Glu 


Lys 


ASp 


Arg 


Ala 


Trp 


Glu 


Arg 


Asp Leu 


Arg 


Thr 


Tyr 


Met 








340 










345 








350 






Asn 


Leu 


Glu 


Phe 

























355 

<210> 3585 

<211> 2782 

c2l2> DNA 

<213> Homo sapiens 

<400> 3585 

ncgcacgcgc agtcgtatcc gtgtgatggg cgggctgttg acggcgctgc gatggctgcc 
60 

tgcgagggca ggagaagcgg agctctcggt tcctctcagt cggacttcct gacgccgcca 
120 
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gtgggcggg^ ccccttgggc cgccgccacc 
180 

ccgcctcatc gggactccat ctcggtgacg 
240 

aagagttggc ggcggcgctc gtgctggagg 
300 

aatatgattc tcttcctcct tgcctttctg 
360 

ttggctgacc attggaaagc tctggctttc 
420 

gaaattgctg ggttaaaacc agcaaatcca 
480 

accgaccctg agaacttacc tgagatttcg 
540 

ggaccacctc acctgcagat tagaccccca 
600 

gaggccacaa aaaggcaaga agcccctgtg 
660 

acagtcatca gctggagggg agcggtgatc 
720 

agaagagcag aagtgcccac caagcctccc 
780 

gtgcatctga actatcgcca gaagggcgtg 
840 

taccgcaagt ttgcatgggg ccatgacgag 
900 

tggtttggcc tcggtctcac actgatcgac 
960 

aggaaagaat ttgaggaagc caggaagtgg 
1020 

gtggacgtca acctgtttga gagcacgatc 
1080 

cacctgtctg gggacagcct cttcctgagg 
1140 

cctgccttca gaacaccatc caagattcct 
1200 

gcccacccgc cacggtggac ctccgacagc 
1260 

gagttccggg agctctcccg tctcacaggg 
1320 

gtgacacagc acatccacgg cctgtctggg 
1380 

aatacccaca gtggcctctt cacccaccfcg 
1440 

agctactatg agtacctgct gaagcagtgg 
1500 

ctggaagact acgtggaagc catcgagggt 
1560 

cccagtaagc tcacctttgt gggggagctt 
1620 

cacctggtgt gcttcctgcc agggacgctg 
1680 

agccacatgg agctggccca ggagctcatg 
1740 



actgta_gfcca tgtacccacc jgccgccgccg 
ctgagctttg gcgagagcta tgacaacagc 
aaatggaagc aactgtcgag attgcagcgg 
cttttctgtg gactcctctt ctacatcaac 
aggctagagg aagagcagaa gatgaggcca 
cccgtcttac cagctcctca gaaggcggac 
tcacagaaga cacaaagaca catccagcgg 
agccaagacc tgaaggatgg gacccaggag 
gatccccgcc cggaaggaga tccgcagagg 
gagcctgagc agggcaccga gctcccttca 
ctgccaccgg ccaggacaca gggcacacca 
attgacgtct tcctgcatgc atggaaagga 
ctgaagcctg tgtccaggtc cttcagtgag 
gcgctggaca ccatgtggat cttgggtctg 
gtgtcgaaga agttacactt tgaaaaggac 
cgcatcctgg gggggctcct gagtgcctac 
aaagctgagg attttggaaa tcggctaatg 
tactcggatg tgaacatcgg tactggagtt 
actgtggccg aggtgaccag cattcagctg 
gataagaagt ttcaggaggc agtggagaag 
aagaaggatg ggctggtgcc catgttcatc 
ggcgtattca cgctgggcgc cagggccgac 
atccagggcg ggaagcagga gacacagctg 
gtcagaacgc acctgctgcg gcactccgag 
gcccacggcc gcttcagtgc caagatggac 
gctctgggcg tctaccacgg cctgcccgcc 
gagacttgtt accagatgaa ccggcagatg 
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gagacggggc tgagtcccga gatcgtgcac 
1800 

gacgtggagg tcaagccagc agacaggcac 
1860 

ctgttctacc tgtaccgcgt cacaggggac 
1920 

ctgcagagct tcagccgatt cacacgggtc 
1980 

gtccaggatc ctcagaagcc cgagcctagg 
2040 

acgctcaagt atctgttctt gctcttctcc 
2100 

tacgtgttca acaccgaagc ccaccctctg 
2160 

ctgctggtgt ggggacttcg ggtgggcaga 
2220 

agggcccacg tagcaccggc aaccgccaag 
2280 

ctcctcgtct ctgctttaat caggacaccg 
2340 

tgatgcgggg tgggctgggc cgctggagcc 
2400 

atgactcacg attgctgaag cctgagcagg 
2460 

aggtggtccc tggtactggg gtgaccgagt 
2520 

tcgtgaagcc tcagatgtcc ccaatccaag 
2580 

cctgaggctc c aggg ctggc tctg^tjgttt 
2640 

ccccgcaggg ggcttggagg gctggacggc 
2700 

ggaatgggtc ttttcggtgg agataaaagt 
2760 

aaaaaaaaaa aaaaaaaaaa aa 
2782 



ttcaaccttt 


acccccagcc 


gggccgtcgg 


aacctgctgc 


ggccagagac 


cgtggagagc 


cgcaaatacc 


aggactgggg 


ctgggagatt 


ccctcgggtg 


gctattcttc 


catcaacaat 


gacaagatgg 


agagcttctt 


cctgggggag 


gatgacccaa 


acctgctcag 


cctggacgcc 


cctatctgga 


cccctgccta 


gggtggatgg 


ggcaccttgc 


tgggtctgtg 


gcattttcca 


tggcccaggc 


tctgaactgg 


ctctgggctc 


tgaggacaag 


tgaggccgtc 


agtcttggtg 


tccgcctgct 


tcctccagaa 


gacacgaatc 


tctctgtggg 


ccgaccagag 


gggggcttcg 


ggacagccca 


gggtgcagct 


ctgcccgggc 


ggtctggagg 


ggctgccgtg 


actccagagg 


acaagctgga 


ctcagggatc 


ctcctgcjccg 


aagtccgtcc 


agctcacggg 


cccctccagt 


tgatttgctc 


taaaaaaaaa 


aaaaaaaaaa 



<210> 3586 

<211> 663 

<212> PRT 

<213> Homo sapiens 



<400> 3586 

Met Ty r pro pro Pro Pr o Pro Pro 

1 -5 
Thr Leu Ser Phe Gly Glu Ser Tyr 
20 

Arg Ser Cys Trp Arg Lys Trp Lys 

35 AX) 
Met lie Leu Phe Leu Leu Ala Phe 

50 55 
Tyr lie Asn Leu Ala Asp His Trp 
65 7-0 
Glu Glu Gin Lys Met Arg Pro Glu 



Pro His Arg Asp Phe lie Ser Val 

10 X3 
Asp Asn Ser Lys Ser Trp Arg Arg 
25 30 
Gin Leu Ser Arg Leu Gin Arg Asn 
45 

Leu Leu Phe Cys Gly Leu Leu Phe 
60 

Lys Ala Leu Ala Phe Arg Leu Glu 

75 BO 
lie Ala Gly Leu Lys Pro Ala Asn 
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Arg 


Lys 


Phe 


Ala 


Trp 
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Trp 


He 


Leu 
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Val 
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He 
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.9-5 
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Asp 
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Gin 


Arg 


Gly 








3-25 
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Asp 


Gly 






14 0 










Glu 


Ala 


Pro 


Val 


Asp 


Pro 


Arg 




-X55 










JL6U 


He 


Ser 
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Arg Gly Ala 


Val 


170 
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Ser 


Arg 


Arg 
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Glu 


Val 










ASjO 






Arg 


Thr 
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Pro 


Val 








205 








He 


Asp 


Val 


Phe 


Leu 


His 


Ala 






-220 










Gly 


His 


Asp 


Glu 


Leu 


Lys 


Pro 




235 










240 


Gly 


Leu 


Gly 


Leu 


Thr 


Leu 


He 


J2SX) 










335 




Gly 


Leu 


Arg 


Lys 


Glu 


Phe 


Glu 










270 






Leu 
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Glu 


Lys 


Asp 


Val 








.2-8 5 








Arg 


He 


Leu 


Gly 


Gly Leu 


Leu 
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Phe 
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Arg 


Lys 


Ala 


Glu 




315 










3^0 


Phe 


Arg 
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Ser 


Lys 


He 


330 










335 




Gly Val 
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Pro 


Arg 










350 






Val 


Thr 


Ser 


He 
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Leu 


Glu 
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Lys 


Phe 


Gin 


Glu 


Ala 






3-8-0 
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Ser 


Gly 


Lys 


Lys 


Asp 




395 










400 


His 


Ser 


Gly 


Leu 


Phe 


Thr 


His 


-410 










A J. 5 




Ala 


Asp 


Ser 


Tyr 


Tyr 


Glu 


Tyr 










430 






Lys 


Gin 


Glu 


Thr 


Gin 


Leu 


Leu 








445 








Val 


Arg 


Thr 


His 


Leu 


Leu 


Arg 






460 










Val 


Gly 


Glu 


Leu 


Ala 


His 


Gly 




475 










4-80 


Val 


Cys 


Phe 


Leu 


Pro 


Gly 


Thr 


490 










495 




Pro 


Ala 


Ser 


His 


Met 


Glu 


Leu 










-5XO 






Gin 


Met 


Asn 


Arg 


Gin 


Met 


Glu 
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515 






520 










525 








Thr 


Gly 


Leu Sex Pro 


GlU 


He 


Val 


His 


Phe 


Asn 


Leu 


Tyr 


Pro 


Gin 


Pro 




530 






535 










540 










Gly 


Arg 


Arg Asp Val 


Glu 


Val 


Lys 


Pro 


Ala 


Asp 


Arg 


His 


Asn 


Leu 


Leu 


545 






550 










555 










560 


Arg 


Pro 


Glu Xhr Val 


Glu 


Ser 


Leu 


Phe 


Tyr 


Leu 


Tyr 


Arg 


Val 


Thr 


Gly 






565 










570 










575 




Asp 


Arg 


Lys Tyr Gin 


Asp 


Trp 


Gly 


Trp 


Glu 


He 


Leu 


Gin 


Ser 


Phe 


Ser 






580 








585 










590 






Arg 


Phe 


Thr Arg Val 


Pro 


Ser 


Gly 


Gly 


Tyr 


ser 


Ser 


He 


Asn 


Asn 


Val 






595 






600 










605 








Gin 


Asp 


Pro Gin Lys 


Pro 


Glu 


Pro 


Arg Asp Lys 


Met 


Glu 


Ser 


Phe 


Phe 




€10 






615 










620 










Leu 


Gly 


Gin .Thr JLeu 


Lys 


Tyr 


Leu 


Phe 


Leu 


Leu 


Phe 


Ser 


Asp Asp 


Pro 


625 






630 










635 










640 


Asn 


Leu 


Leu J&Ar Xeu Asp 


Ala 


Tyr 


Val 


Phe 


Asn 


Thr 


Glu 


Ala 


His 


Pro 






645 










650 










655 





Leu Pro He .Trp Thr JPro Ala 
660 



<210> 3587 

<:211> 3148 

<212> DNA 

<213> Homo sapiens 

<400> 3587 

nctttttttt ttttttttga gtgtggggtc agtttattgg gcatgcgeca gtcagaggct 
60 

gggctggcca gggtcgggta gggcagcagt ttgtctggac cccgagaaac ccaactggaa 
120 

tccagggcct catctgcttc aaagccaaag tcttcctcaa ccttaatcxg caccggggcc 
180 

agctctggag tcagcgcatt tcctgctcgg cgtccatccc gtggcactcg ccgcctcttc 
240 

cgcccactgg gcccctcacc gggggctggg ctgccgggtt ctgggggtgc aggagtcctt 
300 

ct gST9 c SJ99g acagtgtctc tttctctgga ggctcattct ccgcattgcc tggggtgggg 
360 

gcatccgtgc cctggctgcc ctcatcctcc agcacaatgg tgaactggct ggcccggtag 
420 

tcatccccgt aggagtccag cactctcatg aggaacctcc gfctcctgctg cagcctccga 
480 

gttatcctct gcacctgatg gagcctgttc aggacccgct cgttcacctg ctcgatctcc 
540 

cggcagcgcc gacctagtgc ctggtacttt ctgcgattta attcccgctg gcgccgccgc 
600 

cgaccccggg ctgcctcttc ctcttcatct cgctcccgga gccctgagcc gcccagacca 
660 

cctgacacaa actccacttc cgtctccagc tcgctctcca ggatgtggcc accaaatagg 
720 

ggaggcaacg ccaactctga gcctggcggc gctgagaact cctcaaagcc cacggctgcc 
780 

atggtccctt ctctctgctc caattccatc tccgcgacct ccggaagccc cgggcctcag 
84 0 
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